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Abstract 

In addition to the well known common properties such as small world and community structures, recent 
empirical investigations suggest a universal scaling law for the spatial structure of social networks. It 
is found that the probability density distribution of an individual to have a friend at distance r scales as 
P(r) oc r'^. The basic principle that yields this spatial scaling property is not yet understood. Here we 
propose a fundamental origin for this law based on the concept of entropy. We show that this spatial scaling 
law can result from maximization of information entropy, which means individuals seek to maximize the 
diversity of their friendships. Such spatial distribution can benefit individuals significantly in optimally 
collecting information in a social network. 
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Social networks structure is found to be important since it leads to deep insights about how peo- 
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pie interact and how social relations evolve [| 11-11411. It has been found that social networks possess 
common properties such as small-world and community structure [4]. Recently, geographical 
properties of social networks have attracted much attention 11 141 - 12711 . Several empirical studies 
have analyzed the distribution of distances between friends in real social networks. Liben-Nowell 
et al. explored the geographic properties in decentralized search within a large, online social net- 
work [I25I1 . They used data from the LiveJournal online community with about 500,000 members, 
in which their state and city of residence, as well as a list of their LiveJournal friends are available. 
They found that the probability density function (PDF), P{r), of an individual having a friend at a 
geographic distance r is about P{r) oc (see supplementary I). Almost at the same time, Adamic 
and Adar have also found the same phenomenon [16]. They investigated a relatively small social 
network, the Hewlett-Packard Labs email network. In this work, the PDF of the distance is also 
found to scale as P{r) oc More recently, Lambiotte et al. investigated a large mobile phone 



communication network The network consists of 2.5 million mobile phone customers that 
have placed 810 million communications, for whom they have the geographical home location 
information. Their empirical results show that the mobile phone communication network has the 
same scaling properties in the spatial structure. They found that probability of two nodes {u and v) 
to have a long range connection of length r{u, v) is Pr{u, v) oc r{u, v)~^. For 2-dimensional space, 
the number of nodes which have distance r from a given node is proportional to r. This implies 
that the PDF of an individual to have a friend at distance r is P{r) oc r ■ = . Very recently, 
Goldenberg and Levy investigated several large online communities, and also detected the same 



spatial scaling phenomenon ilSQ. From the above empirical investigations, one can conclude that 



the PDF of having a friend at distance r is 

P{r) ocr'\ (1) 

Why does the spatial structure of our social networks possess this kind of scaling property and 
how does it benefit us? Kleinberg has proved that in a J-dimensional space, when the probability 
of having a long range connection of length r between u and v is Pr{u, v) oc r{u, v) the network 



is optimally navigated [|26l- l29|l . For J-dimensional lattice, the number of nodes that have the same 
distance r to a given node is proportional to r"^ ^ So when the network structure is optimal for 
navigability, the PDF of the distance from a given node is P{r) oc r^^^ ■ r = r"' for all d. This 
spatial scaling property enables people to send messages efficiently in minimal number of hops to 



all nodes of the system. However, social networks are usually not constructed for the purpose of 
sending messages between unrelated individuals. Thus, there should be a fundamental origin that 
governs the formation of the spatial scaling law, Eq. (1). 

Here we suggest that the origin of this scaling, Eq. (1), comes from a general perspective 
based on the concept of entropy. We hypothesize that human social behavior is based on gathering 
maximum information through different activities. Making friends can be regarded as a way of 
collecting information. To get optimal information could be a general purpose for an individual 
that shapes the social network architecture. We will show that a social network based on Eq. ([T]) 
is an optimal network which can benefit people in collecting maximal information. 



I. MODEL 



To model a social system we use a toroidal lattice to denote the world in which each node 
represents an individual. We assume that each individual has a finite energy w which can be 
represented by the sum of distances between an individual and all his or her friends, 

m 

^ r{u, v) = w, (2) 

v=l 

where m is the number of direct links of node u. Eq. ^ implies that every node u selects its long 
range acquaintances v, one by one, until the total distance reaches w. 

The information that node v brings to u can be evaluated by considering the information of node 
V and all its neighbors. Thus, the information that u collects can be expressed by the sequence 
of nodes as illustrated in Fig. [Hand the entropy of the whole sequence measures the amount of 
information. We assume that all nodes are equivalent, so the information obtained by one node can 
represent the information obtained by each of the other nodes. Thus, our model for constructing a 
social network is 

n 

Max e = - ^Y^qi log qi, (3) 

!=1 

subjected to Eq. Q. In Eq. (O, qi denotes the frequency of node / in the information sequence (see 
Fig. 1) and n is the size of the network. When i is not a neighbor and not a next nearest neighbor 
of u, qi = 0, and we define qt log qi = 0. Here, Eq. ^ implies that the information entropy s is 
determined by the sequence of friends and friends of friends (For considering also friends of next 
nearest friends, see supplementary IIA). 
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FIG. 1: The friends of node 1. Node 2, 3 and 4 are the friends of node 1 which Eq. (2) yields that 
d{l,2) + d{l,3) + d{l,4) = w. The size of the network is n = 12 and the information sequence is 
{2, 3, 4, 5, 6, 7, 7, 8, 9, 9, 10) and the frequencies of all nodes are ^2 - ^3 - ^4 = ^5 - ^6 = ^8 = ^10 - ji, 
qj = qg = jy, qi = qii - qn - 0. If one site is reached several times when constructing the long range 
connections from node 1 or from its nearest neighbors, the node will appear in the node sequence and in 
Eq. (2) the same number of times. 

n. RESULTS 

Our optimization model (OM) is based on Eqs. Q and ([3]) which represent two competing 
processes. To maximize entropy (Eq. (O), it is preferred to have friends at long distances in order 
to explore new parts of the network and to obtain more information. However the farther one goes 
he can have less friends due to the finite energy limited by Eq. Q. Assuming the PDF of having 
a friend at distance r obeys 

P(r) oc r-", (4) 

we can explore the value of a that yields maximum entropy under the condition of Eq. 

The optimization model is simulated on a toroidal lattice whose size is L x L (L = 10000 
means that individuals can make friends in a population of 10^) and lattice ('Manhattan') distance 
is employed. Because toroidal lattice is a regular network and each node has a unique index, we 
can calculate the lattice distance between any pair of nodes and we do not need to construct the 
whole network, enabling us to simulate very large lattices. 

For a large enough 2-dimensional lattice, the number of nodes that have distance r from a 
given node is proportional to r. So if w ^ +00, that means if we consider the maximal diversity 
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of friendships without any constraints of energy, we expect P{r) oc r to be the optimal entropy 
information since each node has the same probability in the information sequence. In practice, 
individuals naturally have a limited energy w. Our numerical results shown in Fig. [2la indicate 
that when a ^ I, the information entropy s is near its maximum value for a very broad range of 
w. For the range w 6 (5 x 10"^, 10^) and / 6 (50, 1000), we find the optimal or to be a = 1 + 0.05. 

When the size of the lattice is L and P(r) oc r ^ the mean distance between friends is 
Therefore, we can find the average number of friends / to be 

(5) 

which gives one to one correspondence between / and w at the optimal state. When L = 10000 
and w 6 (5 X 10"^, 10^) the average number of friends is / e (50, 1000) which indeed corresponds 
to reality pOt] , In particular, when considering the average number of friends we contact in one 
year, / = 300 [|30|l . the optimal value of a is a = -0.99 ± 0.03 (as shown in Fig. |2l). 

Our results suggest that P(r) oc r"' is the optimal distribution for collecting information be- 
tween all power law distributions. Is P{r) oc r"^ the optimal distribution when considering all 
kinds of distributions? We will demonstrate, based on the following evolutionary model (EM), 
that among all kinds of distributions, P{r) oc r'^ is still the optimal one. In the EM, we also con- 
struct a network on a lattice of size LxL. A node w, is one of the neighbors of node u when there is 
a direct link from u to w,. Each node u has friends at distances r{u, ui) subject to Y^u/eu ^ 
where U is the set of all neighbors of node u. In the initial stage of the EM, P(r) is set to be a 
uniform distribution. Then we employ the extremal optimization method |3l|], to maximize the 
entropy through the evolution of network architecture. At each step, a node is chosen randomly. 
For a chosen node u, we make two operations, deleting and adding neighbors according to the 
marginal improvement of entropy. Suppose u has k neighbors. For the deleting execution, we 
first calculate the marginal entropies of each neighbor of node u, { 4-^, ■ ■ ■ , where 
A Eu- means the change in the entropy of node u when we delete node m, from the neighborhood 
of node u with other parameters being unchanged. Then we randomly select a comparatively 
small \^^\ with probability Pr{Ui) proportional to {rank\-^^\y^~^"^^^^ [131] and delete m, from w's 
neighborhood. For the adding link execution, suppose vi, V2, • • • , vi, are all the candidates which 



are currently next nearest neighbors of node u. We first calculate the marginal entropies of each 
of the candidate, {j^:;^, 7(^7^ ^ " ' " > r{uvi) '^' ^^^^ employ the extremal optimization method 

to choose a node whose marginal entropy is comparatively large among all candidates' marginal 
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FIG. 2: The relationship between e, w, /, a and L in the optimization model, a. The contour map shows the 
relationships between w, a and £, for L = 10000. The colors indicate the value of s. In b, the dependence of 
the information entropy e on o- for / = 300, 500, 1000 is shown, c. The dependence of the optimal a on the 
average number of friends /. The error bars denotes the standard deviations, d. The relationships between 
optimal a and the edge length L of the lattice. From it we can see that for large L the optimal a approaches 
1. The error bars denotes the standard deviations. 

entropies as a friend of node u. We repeat the adding execution until all the candidates are chosen 
or the energy limit (Eq. (2)) is satisfied. 

In the evolutionary model, we have to record all friends of each node and therefore a system 
of size Lx L with L = 10000 is too large to simulate. So we simulate the evolutionary model on 
a toroidal lattice of size 100 x 100. We assume that the energy scales linearly with distance as 
suggested by Eq. (2). Thus, when reducing L from 10,000 to 100 (factor of 100) we expect the 
corresponding energy to be reduced from order of 10^ to order of 10^. We therefore study the EM 
model of L = 100 with w = 1086 (/ = 50). 

In order to find the optimal distribution of the distances, we first employ the optimization model 
described by Eqs. (2)-(4) to analyze the above case with the system size 100 x 100 and w - 10^. 
We find that the maximum entropy is 7.18 and the corresponding a is a = 0.95 + 0.05 (see Fig. 
[3^, b). Next we simulate the evolutionary model of size of 100 x 100 and w - 10^. After long 
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FIG. 3: The results of evolutionary model when L - 100 and / - 50. a. The simulation results of OM on 
a toroidal lattice with the preset power law distribution P(r) oc r'" . b. the dependence of the information 
entropy e or a for / around 40 in the OM. We can see that when / = 50, the optimal exponent is 0.95 and 
it is very close to -1. c. The changes of entropy in the EM with the evolution time. The entropy is fixed 
and the system archives a steady state. The fixed entropy is 7.15 which is very close to the entropy 7.18 in 
the network of L = 100 which we preset the distribution is P(r) oc r"^. The inset denotes the difference of 
the time-entropy curve which implies that the difference decays exponentially. From it we can see that for a 
sufficient long time evolution, the entropy converges to a fixed value and the system achieves a steady state, 
d. The cumulative distribution of the distance in EM is shown in log-linear plot in the steady state. We can 
see that this distribution is very close to P{r) oc r""^ (dashed line). 

term evolution from the initial uniform distribution (each node modify the neighborhood more 
than 40000 times), the system achieves its stationary state (Fig. [3^). The maximum entropy is 
7.15 and the corresponding PDF of the distance between the friends scales as P{r) oc (Fig. [Sji 
and supplementary IIIB), which are very close to the results obtained by OM. So we conclude that 
P{r) oc is the optimal PDF of distances of friendships for collecting maximal information. It 
implies that, the spatial structure of the real social networks is the most optimal structure which 
leads to the maximum diversity of the friends' location and can help individuals to collect infor- 
mation efficiently. We note that, it can be proved analytically, under the assumption that the energy 
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scales linearly with system size, i.e. w = cL, for L —> +00, that P(r) oc will be the optimal 
distribution for maximizing entropy among all power law distributions (see supplementary IIC for 
detailed analysis ). 

m. CONCLUSION 

From the empirical results, we conclude that the probability distribution of having a friend 
at distance r scales as P(r) oc which is a universal spatial property for social networks. It is 
shown here that the origin of this spatial scaling law may result from the maximization of entropy 
which can benefit individuals for optimally collecting information. 
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Supplementary Information 



IV. EXPLANATION FOR SPATIAL SCALING OF LIVEJOURNAL 

n 

In the empirical study of the LiveJournal data set [1], for each distance r, Q(r) is the fraction 
of friendships among all pairs u, v of LiveJournal users with r(u,v) = r. Q(r) = |^ oc r"^ 
Here, F{r) denotes the total number of friendships with distance r and S (r) is the total number 
of pairs of nodes that have distance r. The LiveJournal social network has a fractal dimension of 
about 0.8 (they define the fractal dimension of a network as the exponent d of the best-fit function 
rankuiv) = c ■ r{u, vY, where ranku{v) is the number of people who live closer to u than v and 
c is a constant). We know that for any J-dimensional lattice, the number of nodes that have the 
same distance r to a given node is proportional to r^~K In fractal networks, d should be the fractal 
dimension. Thus the probability density function P{r) of the geographic distance r between friends 
is about P{r) oc r''"^ • Q(r) = r" *^"^ • r'^ = r'^-^ , which is close to r"^ 

V. ABOUT THE OPTIMIZATION MODEL (OM) 

A. Why We Only Consider Friends and Next Nearest Friends? 

We assume that the information obtained from the social network is actually related with the 
influence of friendships. Indeed, in our social life, our friends always talk something about their 
friends. Thus, we assume that friends and next nearest friends are most important and is enough 
to consider them in our model. However, Christakis and Fowler have found recently that the 
influence is mainly within three degrees of separation and call this finding the "Three Degrees of 
Influence Rule" ||2j]. It is computationally difficult to take into account more than two degrees of 
separation of friends to study a system of 10"^ x 10"^. We have therefor performed the numerical 
experiments of the OM in 3000 x 3000 size lattice with w - W (/ = 300) and found that the 
simulated results were similar when we took into account friends and next nearest friends, and 
three degrees of separation (as shown in Fig. |4]). 
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FIG. 4: The relationship between entropy and the power law exponent in different degrees of influence. The 
lattice size is 3000 x 3000,/ = 300. We can see that the phenomena are similar in which -1 is close to the 
optimal exponents. 

B. Algorithm of OM 

When the lattice size is 10000 x 10000, it is hard to record all nodes' links information. Thus, 
we first represent each node an index running from 1 to 10^. This way is easy to obtain a function 
r{u, v) to calculate the lattice distance between any pair of nodes u and v, where u, v are now the 
running index. 

In the OM model all nodes are equivalent. Without losing generality, we can set any node as 
M = 1. To construct the spatial network on the lattice, each time we first randomly generate a 
distance r according to the distribution P(r) oc r r 6 {1,2, •• - L}. Then from the set of nodes 
which have distance r from node 1, a node is chosen randomly as a friend of node 1 and a directed 
link is constructed. Repeating the execution until the energy achieves the limit constraint. After 
the executions we can get all the friends of node 1 . Employing the same approach, we can also get 
all the next nearest friends of node 1. 

C. Analysis on OM 

In this section we will prove that if energy hold 



w = cL, 



(6) 
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where c is constants, for L +00, P{r) oc r Ms the optimal distribution for all P(r) oc r " 
distributions. 

1. Symbol and Expression Descriptions 

P(r) oc r the distribution of distance between friendships. 
Ra , the expectation of the distance which holds P{r) oc r~". 

= ^ is the expectation of number of friends. 
When w = i^^> L is the edge length of the lattice, / denotes the number of friends when 
a=l. 

q"j denotes the probability of the connection between node / and j for a given a. 

pa _ 6^, - ■ ■ , ffi }, denotes the set of friends of node 1, where = ^. 

Hfa i = , q'L ; denotes the probability that node / is one of friends of F". 

Z v-1 ^ log 4C^2?Fa ,(1 ;) " ^ denotes the expectation of entropy of node i when the chosen 

Ja Ja fa ' ' 

probability of node / is ^/ra,, and the time of choosing is 

= Z"=i Zr=i ^ log jiC^fi^lF'' ~ (lF'',d "~^, denotes the expectation of entropy for a given 

J a J a Ja ' 

E(£a), denotes the expectation £„ 

2. Case 1: a < I 



^ ^ x'-'^dx + 0(1) ^ ^(L^- - 1) + 0(1) ^ i_-a^ 
" f'x-^'dx + 0(1) - 1) + 0(1) ■ 

Therefore, for a given w = cL, where c is a constant, we have 

lim fa = lim — = — —. (8) 

L-^oo L->oo I — a 

Because, 



lim maxi/f, < lim — = lim — ; 

''J L^oo JL ^_a^^ ^ ^^^^ L^oo _i_(Ll-« - 1) + 0(1) 



= (9) 

- Jj"x-'^Jx+0(l) Tz^(^^-" - i) + 0'(i; 

and 



^fa,- < max^°^^.. (10) 



13 



Thus, for any F", 



It implies that 



Thus 



lim qpc^i = 0. (11) 

L— >oo 



, .c(2-a) c(2-a) ^^^^ 
km = log(^; + {-\ '-f). (12) 



V 17/ ^ 1 c(2-a) 2, .^^^ 
km £(e„) = log(— + [— ] ), (13) 

i-^oo I - a \ - a 



which is a monotonic increasing function with or < 1. 
3. Case 2: a > \ 

Lemma: if ^ 6 (0, \), for any large enough z we have 

z 

- ^ log ^ > - y - log -Clq\\ - qr\ (14) 

where - 2^=i f log -^C^q^il-qY'^ denotes the expectation of entropy of a node with the probability 
q to be chose and the total choosing time is z (as shown in Fig. |5]). 
Proof: 

According to Law of Large Numbers, lim^^oo - Zv=i f log fCf<?^(l - 4)^ '^ = ~^log<?. 
Thus, we just need to prove 

g{z) = - V - log -qq\l - qf-^ (15) 

is a monotonic increasing function. 

For large enough z, normal distribution is a well approximation to binomial distribution then 
we have 

1 -(-r-rt^ X X 

g{z)= —e^^ -\og-dx, (16) 

Ji o-yln z z 

where cr^ = zq{l - q),H = z<?. 



gXz) = , [log -{q^z^ - 3q^z + 3qz - x^) - Izq^ + 2zq\e^^^xdx (17) 



Az^q{l - q) ^Jnzq{l - q) 
Obviously, 



> (18) 



4^3^(1 - q) ^JnzqiT^^ 
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FIG. 5: Plot ofy = -q log q - g{z). From the plot we can see that Lemma is true. More over when z is small 
g{z) > -q log q is also correct. 



and 



ebgid-Dx > 



(19) 



More over 



fuog- 
Jl z 



{qh^ - 3q^z + 3qz - x^) - Izq^ + lzq\dx = - q^)^ + ©(z^ log z) > (20) 



when q < \, where, 0(z^ log z) denotes the same order of log z- 

Thus, gXz) > which implies that g(z) is a monotonia increasing function and 

- ^log^ > - y - log -c,v(i - ^r^ 

^—t 7 7 



JC=1 



(21) 



For case 2, according to Lemma and Levy stable distribution property (the distance between 
the next nearest neighbor and the origin is also obey P(r) a r " when a > 1). So for large enough 
friends number we have: 

EisJ < £ ^rj^ log (22) 



4rZia) ArZ{a) 



1 ^ 

= ^ - 1) log - log[4Z(a)]}. 

Z(Qr) 

r=l 



More over we can get: 



lim £'(£0,) 

L—i+OQ 



{a - l)(21og2 + logZ(a)) + a + 1 
2{a - 1)2 



(23) 



(24) 



15 



where Z(a) denotes 2r=i ^ Obviously, i)(2iog2+iogZ(a))+a+i ^ monotonic increasing function. 
Thus, for any fixed c, -1 is the optimal exponent. 

VI. ABOUT THE EVOLUTIONARY MODEL (EM) 

A. Why we chose new friend only from the next nearest neighbors? 

There are 2 reasons. The first is that, according to our real social experience, we always make 
some new friends who are the friends of our friends. The second is that EM is a global optimal 
algorithm. Thus if we choose any node as our new friend, the result will be the same theoretically. 

B. How to Measure the Power Law Exponent in EM? 

To accurately measure the exponent value of power law distribution is not a easy work. Es- 
pecially, when the exponent is very close to -1. We use the least square method to evaluate the 
exponent value. We are afraid the least square method is not a good way, so we plot the accu- 
mulated curve. Fortunately, it can be proved that when P{r) oc r"\ the accumulated function in 
log-linear plot will be a straight line. We can see that the distribution is about P{r) oc . 



[1] Liben-Nowell, D., Novak, J., Kumar, R., Raghavan, R and Tomkins, A. Geograph routing in social 
networks. Proc. Natl. Acad. 102, 11623-11628 (2005). 



[2] http://www.calit2.net/newsroom/article.php?id=1558 and the up coming book: Christakis and Fowler: 
Connected: The Surprising Power of Social Networks and How They Shape Our Lives, Little Brown, 
(2009). 



16 



